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Abstract. The handling of units in an automated way by software systems can be a cumbersome procedure 
when the units are parsed as strings. Software systems parsing units have to take into account extensive tables 
of unit names, not always identical between different standards. In the Virtual Observatory context, the transfer 
of metadata in access protocols is specially critical for the understanding of the content of the data (many of 
which are legacy data). Driven by this issue, we present a way to handle units automatically which is based in 
dimensional analysis considerations. Although the approach presented here can be valid to any other branch of 
physics, we concentrate mainly in the application to spectrum handling in the VO context, and show how the 
proposed solution has been implemented in the VOSpec, a tool to handle VO-compatible spectra. 
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1. Introduction 

' Handling of units in spectra can be cumbersome when 
, dealing with different Flux energy densities. For in- 
' stance, the conversion between F\ and is nor- 
, mally done by parsing the unit name strings. Despite 
several efforts to homogenize unit name strings (See 



Greisen et al.(2004)| [Taylor (1995)|[George et al. (1995) I 



several different standards do exist, forcing the parsing- 
string mechanism to stick to one or several of those stan- 
dards. On top of this, the adaptation of already existing 
legacy data to one or other standard might be a cumber- 
some -and even sometimes impossible- task. 

The aforementioned problems led us to try and fig- 
ure out a way to automatically handle units without the 
necessity to parse strings. Using dimensional analysis, we 
have devised an algorithmic way to convert between di- 
mensionally homogeneous quantities. As a simple exam- 
ple, the string-parsing algorithm needed to convert be- 
tween W/cni? / and ergjcrr? j sj A (both F\) would be 
substituted for an algorithm to go from VS^'^ML~^T~^ to 
VS'ML-^T-'^, i.e., dividing by a factor of 10^. 

In section 2 we give a general mathematical formalism 
of the relevant parts of Dimensional Analysis techniques 
that will be needed for the handling of this problem. In this 
section we will follow with very slight modifications the ex- 



cellent book by Szirtes(1997) We also describe the han- 
dling of the dimensional matrix to unveil how to extract 
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dimensional relations between different units on the same 
problem. In section 3 we apply the theory to the case of Fx 
and Ft, conversions. And in section 4, we give a general 
algorithmic method for the conversion between different 
unit systems for the case of spectra together with details 
about the use of this technique in the VOSpec, a tool to 
handle SSAP (Simple Spectrum Access Protocol) compat- 
ible spectra developed at the European Space Astronomy 
Centre (ESAC) of the European Space Agency. 

2. Dimensional Analysis overview 

Dimensional analysis helps in the understanding of certain 
problems for which no analytic mathematical formulation 
exists, or for which the mathematical formulation is too 
complex. In these cases, the dimensional analysis allows 
to extract certain conclusions about the behavior of the 
system without the need of specific mathematical formulae 
relating the different variables of the problem at hand. 

In physics we deal with quantities which have certain 
dimensions. These are combinations of a reduced number 
of basic or fundamental dimensions. These fundamental 
dimensions form a dimensional system. Dimensional sys- 
tems can range from mono-dimensional (only a fundamen- 
tal dimension is used to represent any physical quantity) 
to omni-dimensional (all dimensions are fundamental di- 
mensions) (See ref |Szirtes(1997)_ for examples). The inter- 
mediate, and mostly used, is the multidimensional system, 
in which a reduced set of fundamental dimensions is used. 
In the examples that will follow, we will adhere to the 
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most widely used system of MASS-LENGTH-TIME sys- 
tem with the addition of Temperature, Electric current 
and luminosity to deal with other more complex prob- 
lems. Following a long tradition in dimensional analysis 
(see |Maxw cU(1890)]l wiU cah M-L-T these fundamental di- 
mensions. Following this convention, a physical quantity, 
e.g., a Force, would be represented as follows: 
[F] = H [a] = MLT-^ 

where square brackets should be read as "dimensions 
of and we will call the rightmost part the "dimensional 
equation" of the quantity under consideration. 

2.1. Dimensional Matrix 

Let F{Vi,V2, ...,Vn) = be a physical relation among a 
set of variable quantities Vi . The dimensional equations of 
the different variables will be: 



where: 



[Vi] 

[V2] = ^d: 



'"22 (^"2n 



The matrix formed with the exponents of the dimen- 
sions is called the Dimensional Matrix: 



D 



ail ai2 

OL2I Oi22 



Oi2n 



(1) 



From the dimensional matrix, we can now tackle the 
following problem: how can we find dimensional (or di- 
mensionless) products of the variables at hand?. In other 
words, how do we find the e\,£2T--T£n exponents that 
solve the following equation: 



dfdf. 



(2) 



E 



A : A 



and A is a nonsingular square matrix of size the rank 
of the dimensional matrix, and B is formed by the rest of 
columns included in D and not included in A. The sizes 
of the different matrices are: 



(5) 



the column matrix rjk is formed by £n-p arbitrary 
exponents followed by qp sought dimensional exponents. 
The first e„_p are arbitrary due to the fact that Rd = 
ranfc(D) — p and therefore out of the n exponents we can 
only determine p, leaving n-p arbitrary. Therefore, rjk is: 



£1 
£2 



E 


in) 


X (n) 




square matrix 


I 


(n - 


-p) X 


(n-p) 


identity matrix 





(n - 


-p)x 


(P) 


null matrix 


A 


ip) 


X (P) 




square matrix 


B 


ip) 


X {n - 


'P) 


matrix 


A ^B 


(P) 


X (ri - 


'P) 


matrix 



^n—p 

qi 

92 



The problem is thus reduced to finding the matrix P 
result of the product: 



P — 



E 

k=l 



Eik Zkj 



(6) 



where Eik is as defined earlier and Zkj is the result of 
superimposing the rjk columns for the different variables: 



where: Vi,...,Vn are the variables of the problem, 
di, ...,dp are the fundamental dimensions of the problem, 
qi, ...,qp are the given exponents (sought combinations of 
fundamental dimensions; to be set equal to zero for di- 
mensionless products). 

The problem is therefore reduced to solving for ej the 
following system of linear equations: 



E 



Dij Sj 



(3) 



It can be shown that (see Szirtes(1997) I the solution 
to the above is: 



n 

E 

fc=i 



Ejk rjk 



(4) 



£11 
£21 



£12 
£22 



£(ri-p)l £(n-p)2 

qi qi 

<72 92 



qp 



qp 



£lp 

£2p 

£{n—p)p 

qi 

92 

qp 



(7) 



both P and Z are of size nxp. 

In the particular case where we are seeking for dimen- 
sionless products, all the qi will be set to zero (dimen- 
sional exponents) as a dimensionless quantity is consid- 
ered to have dimension=l (for a dissertation on the conve- 
nience to call dimensionless these type of quantities, check 



Szirtes(1997)l. 
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2.2. Complete set of dimensional products: 
Buckingham theorem 

The so called Buckingham's theorem reads as follows: 
The number of independent dimensional products 
which can be composed for a given number of variables 
and dimensions is: 

= n — R]j for dimensionless products 

n.^ = n — R]j + 1 for dimensional products 

where n is the number of variables and Rdm is the 
rank of the dimensional matrix. 

In this case, the sizes of the matrices at would be: 



(8) 



An obvious advantage of finding the dimensionless 
products of the problem under investigation is that it 
reduces the number of parameters relevant to the sys- 
tem. In the case that the precise mathematical formula 
governing the behavior of the system is known, e.g., the 
Navier-Stokes equation for a fluid system, then the di- 
mensionless products give information as well on the rel- 
ative importance of each of the terms in the equation, 
helping in the simplification of the equations for certain 
combinations of the dimensionless products (for a beau- 
tiful example regarding the Navier-Stokes equations, see 
Diez-Roche(T980)l). 



I 


in- 


Rd) X 


{n - Rd) 


identity matrix 





{n — 


Rd) X 


(Rd) 


null matrix 


A 


{Rd 


) X {Rc 


) 


square matrix 


B 


{Rd 


) X (n - 


-Rd) 


matrix 


E 


{n) X {n) 




square matrix 


A ^B 


{Rd 


) X (ri - 


-Rd) 


matrix 



3. Application to Spectral Fluxes 

Using the dimensional analysis principles, we will extract 
the dimensionless products for flux densities. These prod- 
ucts will be used later, in section 4, to achieve the unit 
conversion between flux densities in different units. 



3.1. Fx versus A 

To construct the dimensional matrix for F\ we will con- 
sider a spectral energy distribution of the form: 



Fx{c,X,h) = 



(9) 



according to previous sections, we will have the follow- 
ing set of matrices: 



A = 



1 

1 1 2 

-f -1 



B 



A^B = 



-2 
5 
-1 



E 



1 


1 


1 


1 







































P = E* Z = 



10 

-2 -1 -1 
5-11 1 
-110 



1111 

-2 -2 -2 -2 
5 5 5 5 
-1 -1 -1 -1 



where following the general practice (see Szirtes(1997) i 
we have chosen the matrix A to be composed of the inde- 
pendent variables in relation and he matrix P gives 
the exponents(columns) on the dimensions (rows) that 
give rise to dimensionless products. According to the the- 
ory before, the number of dimensionless products in this 
case would be n — Rdm, i-e., 4-3=1, hence the fact that 
three of the columns are linear combinations of the re- 
maining one (identical, in this case) in the P matrix only 
giving result to one dimensionless product combination: 



1111 

-2 -2 -2 -2 
5 5 5 5 
-1 -1 -1 -1 



where rows identify the variables in (|10|l and columns 
correspond to the exponents of those variables. 

Therefore, the unique dimensional product thus for 
this case would be: 



7T = F), 



c2/i 



(12) 



3.2. Fy versus A 



Following exactly the same procedure as before, we would 
have: 



The dimensional equations of the different members of 
the previous relation are: 

[Fa] = ML-^T-^, [c] = LT-\ [A] = L, [h] = ML^T-\lO) 
we now construct the following table: 





F, 


c 


A 


h 


M 


1 








1 


L 





1 


1 


2 


T 


-2 


-1 





-1 



(13) 



and therefore: 





Fx 


c 


A 


h 


M 


1 








1 


L 


-1 


1 


1 


2 


T 


-3 


-1 





-1 



(11) 



A = 



1 

1 1 2 

-1 -1 



B 



4 



Osuna-Salgado; Dimensional Analysis and VO Spectra 



E 



1111 






E * Z 



10 

-1 -1 -1 
3-11 1 
-110 



1111 

-1 -1 -1 -1 
3 3 3 3 
-1 -1 -1 -1 



Conversions in the x-axis to velocity space do need a 
central reference wavelength (frequency) which will have 
to accompany the metadata for the data. Once this value 
is known, the transformation of the x-axis will just consist 
of a translation plus a dilation, and the Y-axis will be kept 
as is. In the case that the data are coming with velocity in 
the X-axis, they will have to contain the reference lambda 
giving rise to those velocity values, and the process to 
convert the data to wavelength values would be simply 
inverted. 



And therefore the only dimcnsionlcss product would 4. Algorithmic approach and use in VOSpec 



be: 



ch 



(14) 



Both dimcnsionlcss quantities in (|12() and (|14|l are de- 
scriptions of the same physical problem. As they are lineal 
in the dependent variable {F\ and F,y respectively) we are 
able to conclude that they will be equivalent, i.e., that we 
can write in this case: 



TT = TT 

and therefore: 

— = 

c^h ch 



Fx - -^F^ 



(15) 



which is the physical result expected when transform- 
ing flux densities. 

The dimcnsionlcss product obtained when repeating 
the above procedure for F^ and Fx with v as the indepen- 
dent variable give: 



F, 



and 



TT = Fx—r-r respectively 



This working example shows how the dimensional 
analysis can be used in the handling of spectrum fluxes 
and unit transformation. Much more complex problems 
can be tackled using this approach. For a nice compila- 
tion of literally hundreds of examples, consult the book 



by Szirtes(1997) 



In what follows, we give an algorithm to bring the 
aforementioned ideas to practice when designing a client 
tool to handle Virtual Observatory spectra. 

3.3. Comment on apparent velocity as X-axis spectral 
coordinate 



In the paper by |Greisen et al.(2004)| the apparent radial 
velocity is considered as one of the possible spectral coor- 
dinates. A whole set of possible transformations between 
the different spectral coordinates ("x-axes") is given to- 
gether with their derivatives. The paper deals spectral 
coordinate transformations, rather than with flux trans- 
formations (on the "y-axis"). In this work, we deal with 
energy densities which depend, generally, on A or v. 



In the Virtual Observatory context, there is a need to 
have a standard protocol to make spectra accessible from 
different projects in a simple way. The idea is to cre- 
ate a protocol for spectra in the same line as the al- 
ready standard Simple Image Access Protocol (SIAP), 
that would allow for the creation of on-the-fly Spectral 
Energy Distributions (SEDs) from heterogeneous data 
sets. 

This protocol has been called Simple Spectrum Access 
Protocol (SSAP) and basically implements a two-step pro- 
cess: 

— In the first step, a cone search is done on available 
services and the match results are sent, together with 
metadata, in a VO standard VOTable. 

— In the second step, the pointers to the real data files 
(spectra) are called and data are retrieved. 

The main problem faced when trying to create an SED 
using data from different projects/formats is to compare 
data different units. We should be able to transform spec- 
tral coordinate units and fluxes to a common unit system. 

Our proposed solution is to specify in the metadata 
(first SSAP step) for every spectrum, the dimensional 
equation for the spectrum axes, and use dimensional anal- 
ysis to extract the conversion formulae needed to go from 
one to the other. 

To prove that this on-thc-fly conversion was possible, 
we developed a tool called VOSpec, able to request differ- 
ent SSAP servers and produce a common SED from dif- 
ferent spectra in different formats from different projects. 

The application is already available to the general pub- 
lic at: http://esavo.esa.int/vospec/ and it has been 
used for the AVO demo which took place on Jan 25 & 26, 
2005 at ESAC. See|AVO Demo(2005)|for reference. 



At the time of writing this paper, spectra from ISO, 
lUE, HST (FOS), SDSS, HyperLeda and FUSE 

projects are already providing SSAP access to their data. 
All the spectra from these services can be superimposed 
in the same display, and the user can generate on-the-fly 
SEDs as can be seen in Figure Spectra from projects 
that don't have SSAP services, can be loaded locally using 
the SSAP Wrapper Creator integrated in VOSpec. 



Osuna-Salgado: Dimensional Analysis and VO Spectra 



5 






Title 
E [_ 


Ra 


C 


F 

1- 


B 1 S 




INES-.The 1... 
INES. Tlie 1... 


E t£. t 
E 1 £. 
E E. 






t- t tt I 


□ [ ^ 



Fig. 1. SED generated for the quasar 3c273 using VOSpec, covering from Radio to X-Ray spectral data. From left to 
right: Radio and sub-mihmiter points (local files loaded data taken from the Geneve University database). Infrared 
spectrum data from the ISO SSAP service (LWS and PHOT data). Optical and ultraviolet spectrum data from FUSE 
(SSAP service) and HST(FOS) (SSAP service). On the right, X-Ray data from XMM-Newton (processed local file) 



4.1. Algorithmic approacli 

There are several ways to describe a spectrum flux and the 
spectral coordinate inside a 1-D spectrum. A conversion 
table could be used to make the transformations accord- 
ingly, but it is not easy to use in an automatic algorithm, 
and this would limit the number of possible transforma- 
tions allowed by the system. 

In this section, we will describe an algorithmic way to 
approach the units problem from the dimensional point of 
view and show how this is used in the VOSpec application. 

A unit can be described in the following way: 

[UNIT] = Scaling * WL^T" (16) 

where the scaling factor is defined with respect to a certain 
common system of units and the exponents a,b,c define 
the unit dimcnsionally. Wc will choose the SI as our base 
dimensional system of reference. 

In order to understand how the algorithm works, sup- 
pose we arc dealing with a spectrum in Jansky (y-axis) 
and Hertz (x-axis). The units dimensions (and scale fac- 
tors) turn out to be: 



Suppose now, we want to convert one spectrum point 
defined by the pair [v{Hz),Fv{Jy)\ to other ones, e.g. 

M/^"^)'^A(^)],i.e.: 



W 



First we need to generate the matrix E for the original 
system, as we saw in section 3: 







c 


V 


h 


M 


1 








1 


L 





1 





2 


T 


-2 


-1 


-1 


-1 



(17) 



so, in this case: 



1 

2 
-1 -1 



-2 

1 
1 



1 

-1 -1 




and constructing B from the Flux density units: 



v{Hz) 

Fu{Jy) 



T- 
10" 



26 



MT- 



B = 



-A-^B 



2 
-3 
-1 



where 1 and 10 are the reference scaling factors to so finally, constructing the matrix E using the rules 
the SI units system for Hz and Jy respectively. described in section 3: 
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E = 



10 
2-210 

-3 1 -1 -1 
-110 



Now, to go from one system to the other, we have to 
generate two different Z matrices, one for the spectrum 
coordinate transformation and the second one for the flux 
transformation. As we are looking for certain target di- 
mensions, the Z matrix will have to include the sought 
dimensional exponents, and therefore, as the final spec- 
trum coordinates are fim (L) we will have: 



To finalize the transformation, we must include the 
scaling factors of the different units. To achieve this goal, 
we note that every time a magnitude is used, the scaling 
must appear, i.e.. 



5, 



%m)A(/UTO) 
W 



)Fx(- 



) = S^jy^F^Jy) 



Where the S elements correspond to the scalings with 
respect to a common system of reference units (in this 
case SI). 



M 
L 
T 



where the first zero is imposing no dependence in the 
flux, and the rest are the exponents in M, L, T respec- 
tively. 

For the final system of units, the Fx is 

{ML-^T-^y. 



w 



1 


F. 


1 


M 


-1 


L 


-3 


T 



where, as it was defined in previous section, the first 
element is the dependence in the flux and the rest of the 
elements in the Z are the dimensional equation exponents 
for M,L and T respectively. 

If we multiply then the original E matrix with these Z 
vectors we obtain for the spectral coordinate: 



Pi =E*Zi 



F. 
c 

V 

h 



and for the flux: 



P2 = E * Z2 = 



c 
u 
h 



That means, respectively: 



[A] = 



[c] 



W 



Finally we obtain: 



1 



Fx{- 



W 



-) = 



c(m/s) 

S(pm)-S(Hz) v{Hz) 

c{mls) 
■ v{Hz) 



10- 



26 



-) 



F.{Jy) 



u^jHz) 
c{m/s) 



-F.iJy) 



1010 
W-^^F^{Jy) 



c{m/s) 
c{m/s) 



These final formulae tell us how to express the values 
of a [x, y] point in the final units as a function of the point 
in the original ones. 

The algorithm can be summarized as follows: 

1. Construct the matrix A, using the spectral coordinate, 
c and h dimensional equations. 

2. Construct the vector B using the flux density dimen- 
sional equation. 

3. Invert the matrix A, and construct the matrix E as it 
was described in point 1. 

4. Construct two Z vectors using spectral coordinate and 
flux density dimensional equations of the final units. 

5. Multiply the matrix E with the two Z vectors to obtain 
the conversion factors. 

6. Finally, use the scaling factors to finish the conversion. 

5. Conclusions 

We have shown how to make use of Dimensional Analysis 
techniques to handle unit conversion in an automated way 
for the case of spectral fiux densities. 

We have proposed the IVOA to include the SCALEQ 
and DIMEQ parameters as part of the Simple Spectrum 
Access Protocol so that clients can use Dimensional 
Analysis algorithms to handle units automatically. 

The approach shown is not only relevant to the 
Spectral access within the VO, and can be extended -as 
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mentioned in the introduction- to any physical problem 
by including the relevant dimensions. 

In this sense, we imagine any unit as composed of three 

main attributes: 

1. A name (and possibly a symbol) 

2. A SCALEQ (giving the Scaling of the unit with respect 
to the International System of Units) 

3. A DIMEQ (giving the dimensions of the unit) 

An example of a serialization of this units model could 
be the FITS representation, in which a unit like the 
" Jansky" could be represented in the following way: 



CUNIT =' Jansky' 
C SCALEQ =' IE - 26' 
C DIMEQ =' MT * * - 2' 

Certainly, to cover all possible physical units, more di- 
mensions than M-L-T would be needed, but the set will 
always be far smaller than the units they can represent. 
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